A general instrumental variable framework for regression analysis with outcome missing not at random.

نویسندگان

  • Eric J Tchetgen Tchetgen
  • Kathleen E Wirth
چکیده

The instrumental variable (IV) design is a well-known approach for unbiased evaluation of causal effects in the presence of unobserved confounding. In this article, we study the IV approach to account for selection bias in regression analysis with outcome missing not at random. In such a setting, a valid IV is a variable which (i) predicts the nonresponse process, and (ii) is independent of the outcome in the underlying population. We show that under the additional assumption (iii) that the IV is independent of the magnitude of selection bias due to nonresponse, the population regression in view is nonparametrically identified. For point estimation under (i)-(iii), we propose a simple complete-case analysis which modifies the regression of primary interest by carefully incorporating the IV to account for selection bias. The approach is developed for the identity, log and logit link functions. For inferences about the marginal mean of a binary outcome assuming (i) and (ii) only, we describe novel and approximately sharp bounds which unlike Robins-Manski bounds, are smooth in model parameters, therefore allowing for a straightforward approach to account for uncertainty due to sampling variability. These bounds provide a more honest account of uncertainty and allows one to assess the extent to which a violation of the key identifying condition (iii) might affect inferences. For illustration, the methods are used to account for selection bias induced by HIV testing nonparticipation in the evaluation of HIV prevalence in the Zambian Demographic and Health Surveys.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Exploiting instrumental variables in causal inference with nonignorable outcome nonresponse using principal stratification

In this paper we consider a specific post-treatment complication that may arise in both randomized and observational studies, namely the problem of nonignorable nonresponse on an outcome variable. This is a typical topic usually known in the econometric literature as endogenous selection; here we tackle this problem specifically within a causal inference framework. By exploiting Principal Strat...

متن کامل

تحلیل درستنمایی ماکزیمم مدل رگرسیون لجستیک در حالتی که داده های متغیرهای پیشگو کامل نیستند ولی متغیرهای کمکی وجود دارند

Background and Objectives: Missing data exist in many studies, e.g. in regression models, and they decrease the model's efficacy. Many methods have been suggested for handling incomplete data: they have generally focused on missing outcome values. But covariate values can also be missing.Materials and Methods: In this paper we study the missing imputation by the EM algorithm and auxiliary varia...

متن کامل

Allowing for missing outcome data and incomplete uptake of randomised interventions, with application to an Internet-based alcohol trial

Missing outcome data and incomplete uptake of randomised interventions are common problems, which complicate the analysis and interpretation of randomised controlled trials, and are rarely addressed well in practice. To promote the implementation of recent methodological developments, we describe sequences of randomisation-based analyses that can be used to explore both issues. We illustrate th...

متن کامل

A Non-Random Dropout Model for Analyzing Longitudinal Skew-Normal Response

In this paper, multivariate skew-normal distribution is em- ployed for analyzing an outcome based dropout model for repeated mea- surements with non-random dropout in skew regression data sets. A probit regression is considered as the conditional probability of an ob- servation to be missing given outcomes. A simulation study of using the proposed methodology and comparing it with a semi-parame...

متن کامل

مقایسه روش شناسی تحلیل مورد- شاهدی لانه گزیده و هم‌گروهی بر روی داده‌های مربوط به بیماری سل شهرستان: یک تجربه

Background and objective: The nested case-control study has become popular as an efficient alternative to the full-cohort design. This study compares the results of a nested case-control analysis approach with the full cohort analysis. Methods: A cohort of 276 subjects (new cases from a TB registry) was used for this study. Cox Regression model was used for the full cohort analysis. In orde...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Biometrics

دوره 73 4  شماره 

صفحات  -

تاریخ انتشار 2017